602 research outputs found

    Antennal transcriptome profiles of anopheline mosquitoes reveal human host olfactory specialization in Anopheles gambiae

    Get PDF
    BACKGROUND: Two sibling members of the Anopheles gambiae species complex display notable differences in female blood meal preferences. An. gambiae s.s. has a well-documented preference for feeding upon human hosts, whereas An. quadriannulatus feeds on vertebrate/mammalian hosts, with only opportunistic feeding upon humans. Because mosquito host-seeking behaviors are largely driven by the sensory modality of olfaction, we hypothesized that hallmarks of these divergent host seeking phenotypes will be in evidence within the transcriptome profiles of the antennae, the mosquito's principal chemosensory appendage. RESULTS: To test this hypothesis, we have sequenced antennal mRNA of non-bloodfed females from each species and observed a number of distinct quantitative and qualitative differences in their chemosensory gene repertoires. In both species, these gene families show higher rates of sequence polymorphisms than the overall rates in their respective transcriptomes, with potentially important divergences between the two species. Moreover, quantitative differences in odorant receptor transcript abundances have been used to model potential distinctions in volatile odor receptivity between the two sibling species of anophelines. CONCLUSION: This analysis suggests that the anthropophagic behavior of An. gambiae s.s. reflects the differential distribution of olfactory receptors in the antenna, likely resulting from a co-option and refinement of molecular components common to both species. This study improves our understanding of the molecular evolution of chemoreceptors in closely related anophelines and suggests possible mechanisms that underlie the behavioral distinctions in host seeking that, in part, account for the differential vectorial capacity of these mosquitoes

    Bayesian Inference of Species Trees from Multilocus Data

    Get PDF
    Until recently, it has been common practice for a phylogenetic analysis to use a single gene sequence from a single individual organism as a proxy for an entire species. With technological advances, it is now becoming more common to collect data sets containing multiple gene loci and multiple individuals per species. These data sets often reveal the need to directly model intraspecies polymorphism and incomplete lineage sorting in phylogenetic estimation procedures

    A complementary view on the growth of directory trees

    Full text link
    Trees are a special sub-class of networks with unique properties, such as the level distribution which has often been overlooked. We analyse a general tree growth model proposed by Klemm {\em et. al.} (2005) to explain the growth of user-generated directory structures in computers. The model has a single parameter qq which interpolates between preferential attachment and random growth. Our analysis results in three contributions: First, we propose a more efficient estimation method for qq based on the degree distribution, which is one specific representation of the model. Next, we introduce the concept of a level distribution and analytically solve the model for this representation. This allows for an alternative and independent measure of qq. We argue that, to capture real growth processes, the qq estimations from the degree and the level distributions should coincide. Thus, we finally apply both representations to validate the model with synthetically generated tree structures, as well as with collected data of user directories. In the case of real directory structures, we show that qq measured from the level distribution are incompatible with qq measured from the degree distribution. In contrast to this, we find perfect agreement in the case of simulated data. Thus, we conclude that the model is an incomplete description of the growth of real directory structures as it fails to reproduce the level distribution. This insight can be generalised to point out the importance of the level distribution for modeling tree growth.Comment: 16 pages, 7 figure

    Why highly expressed proteins evolve slowly

    Get PDF
    Much recent work has explored molecular and population-genetic constraints on the rate of protein sequence evolution. The best predictor of evolutionary rate is expression level, for reasons which have remained unexplained. Here, we hypothesize that selection to reduce the burden of protein misfolding will favor protein sequences with increased robustness to translational missense errors. Pressure for translational robustness increases with expression level and constrains sequence evolution. Using several sequenced yeast genomes, global expression and protein abundance data, and sets of paralogs traceable to an ancient whole-genome duplication in yeast, we rule out several confounding effects and show that expression level explains roughly half the variation in Saccharomyces cerevisiae protein evolutionary rates. We examine causes for expression's dominant role and find that genome-wide tests favor the translational robustness explanation over existing hypotheses that invoke constraints on function or translational efficiency. Our results suggest that proteins evolve at rates largely unrelated to their functions, and can explain why highly expressed proteins evolve slowly across the tree of life.Comment: 40 pages, 3 figures, with supporting informatio

    Horizontal Transfer and Death of a Fungal Secondary Metabolic Gene Cluster

    Get PDF
    A cluster composed of four structural and two regulatory genes found in several species of the fungal genus Fusarium (class Sordariomycetes) is responsible for the production of the red pigment bikaverin. We discovered that the unrelated fungus Botrytis cinerea (class Leotiomycetes) contains a cluster of five genes that is highly similar in sequence and gene order to the Fusarium bikaverin cluster. Synteny conservation, nucleotide composition, and phylogenetic analyses of the cluster genes indicate that the B. cinerea cluster was acquired via horizontal transfer from a Fusarium donor. Upon or subsequent to the transfer, the B. cinerea gene cluster became inactivated; one of the four structural genes is missing, two others are pseudogenes, and the fourth structural gene shows an accelerated rate of nonsynonymous substitutions along the B. cinerea lineage, consistent with relaxation of selective constraints. Interestingly, the bik4 regulatory gene is still intact and presumably functional, whereas bik5, which is a pathway-specific regulator, also shows a mild but significant acceleration of evolutionary rate along the B. cinerea lineage. This selective preservation of the bik4 regulator suggests that its conservation is due to its likely involvement in other non–bikaverin-related biological processes in B. cinerea. Thus, in addition to novel metabolism, horizontal transfer of wholesale metabolic gene clusters might also be contributing novel regulation

    The Awesome Power of Yeast Evolutionary Genetics: New Genome Sequences and Strain Resources for the Saccharomyces sensu stricto Genus

    Get PDF
    High-quality, well-annotated genome sequences and standardized laboratory strains fuel experimental and evolutionary research. We present improved genome sequences of three species of Saccharomyces sensu stricto yeasts: S. bayanus var. uvarum (CBS 7001), S. kudriavzevii (IFO 1802T and ZP 591), and S. mikatae (IFO 1815T), and describe their comparison to the genomes of S. cerevisiae and S. paradoxus. The new sequences, derived by assembling millions of short DNA sequence reads together with previously published Sanger shotgun reads, have vastly greater long-range continuity and far fewer gaps than the previously available genome sequences. New gene predictions defined a set of 5261 protein-coding orthologs across the five most commonly studied Saccharomyces yeasts, enabling a re-examination of the tempo and mode of yeast gene evolution and improved inferences of species-specific gains and losses. To facilitate experimental investigations, we generated genetically marked, stable haploid strains for all three of these Saccharomyces species. These nearly complete genome sequences and the collection of genetically marked strains provide a valuable toolset for comparative studies of gene function, metabolism, and evolution, and render Saccharomyces sensu stricto the most experimentally tractable model genus. These resources are freely available and accessible through www.SaccharomycesSensuStricto.org

    Functional significance may underlie the taxonomic utility of single amino acid substitutions in conserved proteins

    Get PDF
    We hypothesized that some amino acid substitutions in conserved proteins that are strongly fixed by critical functional roles would show lineage-specific distributions. As an example of an archetypal conserved eukaryotic protein we considered the active site of ß-tubulin. Our analysis identified one amino acid substitution—ß-tubulin F224—which was highly lineage specific. Investigation of ß-tubulin for other phylogenetically restricted amino acids identified several with apparent specificity for well-defined phylogenetic groups. Intriguingly, none showed specificity for “supergroups” other than the unikonts. To understand why, we analysed the ß-tubulin Neighbor-Net and demonstrated a fundamental division between core ß-tubulins (plant-like) and divergent ß-tubulins (animal and fungal). F224 was almost completely restricted to the core ß-tubulins, while divergent ß-tubulins possessed Y224. Thus, our specific example offers insight into the restrictions associated with the co-evolution of ß-tubulin during the radiation of eukaryotes, underlining a fundamental dichotomy between F-type, core ß-tubulins and Y-type, divergent ß-tubulins. More broadly our study provides proof of principle for the taxonomic utility of critical amino acids in the active sites of conserved proteins

    TaxMan: a taxonomic database manager

    Get PDF
    BACKGROUND: Phylogenetic analysis of large, multiple-gene datasets, assembled from public sequence databases, is rapidly becoming a popular way to approach difficult phylogenetic problems. Supermatrices (concatenated multiple sequence alignments of multiple genes) can yield more phylogenetic signal than individual genes. However, manually assembling such datasets for a large taxonomic group is time-consuming and error-prone. Additionally, sequence curation, alignment and assessment of the results of phylogenetic analysis are made particularly difficult by the potential for a given gene in a given species to be unrepresented, or to be represented by multiple or partial sequences. We have developed a software package, TaxMan, that largely automates the processes of sequence acquisition, consensus building, alignment and taxon selection to facilitate this type of phylogenetic study. RESULTS: TaxMan uses freely available tools to allow rapid assembly, storage and analysis of large, aligned DNA and protein sequence datasets for user-defined sets of species and genes. The user provides GenBank format files and a list of gene names and synonyms for the loci to analyse. Sequences are extracted from the GenBank files on the basis of annotation and sequence similarity. Consensus sequences are built automatically. Alignment is carried out (where possible, at the protein level) and aligned sequences are stored in a database. TaxMan can automatically determine the best subset of taxa to examine phylogeny at a given taxonomic level. By using the stored aligned sequences, large concatenated multiple sequence alignments can be generated rapidly for a subset and output in analysis-ready file formats. Trees resulting from phylogenetic analysis can be stored and compared with a reference taxonomy. CONCLUSION: TaxMan allows rapid automated assembly of a multigene datasets of aligned sequences for large taxonomic groups. By extracting sequences on the basis of both annotation and BLAST similarity, it ensures that all available sequence data can be brought to bear on a phylogenetic problem, but remains fast enough to cope with many thousands of records. By automatically assisting in the selection of the best subset of taxa to address a particular phylogenetic problem, TaxMan greatly speeds up the process of generating multiple sequence alignments for phylogenetic analysis. Our results indicate that an automated phylogenetic workbench can be a useful tool when correctly guided by user knowledge

    Exponential distribution of long heart beat intervals during atrial fibrillation and their relevance for white noise behaviour in power spectrum

    Full text link
    The statistical properties of heart beat intervals of 130 long-term surface electrocardiogram recordings during atrial fibrillation (AF) are investigated. We find that the distribution of interbeat intervals exhibits a characteristic exponential tail, which is absent during sinus rhythm, as tested in a corresponding control study with 72 healthy persons. The rate of the exponential decay lies in the range 3-12 Hz and shows diurnal variations. It equals, up to statistical uncertainties, the level of the previously uncovered white noise part in the power spectrum, which is also characteristic for AF. The overall statistical features can be described by decomposing the intervals into two statistically independent times, where the first one is associated with a correlated process with 1/f noise characteristics, while the second one belongs to an uncorrelated process and is responsible for the exponential tail. It is suggested to use the rate of the exponential decay as a further parameter for a better classification of AF and for the medical diagnosis. The relevance of the findings with respect to a general understanding of AF is pointed out
    corecore